Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles
Identifieur interne : 000908 ( Main/Exploration ); précédent : 000907; suivant : 000909Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles
Auteurs : Mehdi Haji [Canada] ; D. Bui [Canada] ; Y. Suen [Canada]Source :
- Lecture Notes in Computer Science [ 0302-9743 ] ; 2009.
Abstract
Abstract: Document images obtained from scanners or photocopiers usually have a black margin which interferes with subsequent stages of page segmentation algorithms. Thus, the margins must be removed at the initial stage of a document processing application. This paper presents an algorithm which we have developed for document margin removal based upon the detection of document corners from projection profiles. The algorithm does not make any restrictive assumptions regarding the input document image to be processed. It neither needs all four margins to be present nor needs the corners to be right angles. In the case of the tilted documents, it is able to detect and correct the skew. In our experiments, the algorithm was successfully applied to all document images in our databases of French and Arabic document images which contain more than two hundred images with different types of layouts, noise, and intensity levels.
Url:
DOI: 10.1007/978-3-642-04146-4_109
Affiliations:
Links toward previous steps (curation, corpus...)
- to stream Istex, to step Corpus: 000542
- to stream Istex, to step Curation: 000535
- to stream Istex, to step Checkpoint: 000430
- to stream Main, to step Merge: 000916
- to stream Main, to step Curation: 000908
Le document en format XML
<record><TEI wicri:istexFullTextTei="biblStruct"><teiHeader><fileDesc><titleStmt><title xml:lang="en">Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles</title>
<author><name sortKey="Haji, Mehdi" sort="Haji, Mehdi" uniqKey="Haji M" first="Mehdi" last="Haji">Mehdi Haji</name>
</author>
<author><name sortKey="Bui, D" sort="Bui, D" uniqKey="Bui D" first="D." last="Bui">D. Bui</name>
</author>
<author><name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
</author>
</titleStmt>
<publicationStmt><idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:DFD938AAD9B41127DA9E2CDF0DDEA95C9B8A83A3</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1007/978-3-642-04146-4_109</idno>
<idno type="url">https://api.istex.fr/document/DFD938AAD9B41127DA9E2CDF0DDEA95C9B8A83A3/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000542</idno>
<idno type="wicri:Area/Istex/Curation">000535</idno>
<idno type="wicri:Area/Istex/Checkpoint">000430</idno>
<idno type="wicri:doubleKey">0302-9743:2009:Haji M:simultaneous:document:margin</idno>
<idno type="wicri:Area/Main/Merge">000916</idno>
<idno type="wicri:Area/Main/Curation">000908</idno>
<idno type="wicri:Area/Main/Exploration">000908</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title level="a" type="main" xml:lang="en">Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles</title>
<author><name sortKey="Haji, Mehdi" sort="Haji, Mehdi" uniqKey="Haji M" first="Mehdi" last="Haji">Mehdi Haji</name>
<affiliation wicri:level="1"><country xml:lang="fr">Canada</country>
<wicri:regionArea>Centre for Pattern Recognition and Machine Intelligence, Concordia University, 1455 de Maisonneuve Blvd. West, H3G 1M8, Montreal, Quebec</wicri:regionArea>
<wicri:noRegion>Quebec</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Canada</country>
</affiliation>
</author>
<author><name sortKey="Bui, D" sort="Bui, D" uniqKey="Bui D" first="D." last="Bui">D. Bui</name>
<affiliation wicri:level="1"><country xml:lang="fr">Canada</country>
<wicri:regionArea>Centre for Pattern Recognition and Machine Intelligence, Concordia University, 1455 de Maisonneuve Blvd. West, H3G 1M8, Montreal, Quebec</wicri:regionArea>
<wicri:noRegion>Quebec</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Canada</country>
</affiliation>
</author>
<author><name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
<affiliation wicri:level="1"><country xml:lang="fr">Canada</country>
<wicri:regionArea>Centre for Pattern Recognition and Machine Intelligence, Concordia University, 1455 de Maisonneuve Blvd. West, H3G 1M8, Montreal, Quebec</wicri:regionArea>
<wicri:noRegion>Quebec</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1"><country wicri:rule="url">Canada</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series><title level="s">Lecture Notes in Computer Science</title>
<imprint><date>2009</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="eISSN">1611-3349</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">DFD938AAD9B41127DA9E2CDF0DDEA95C9B8A83A3</idno>
<idno type="DOI">10.1007/978-3-642-04146-4_109</idno>
<idno type="ChapterID">109</idno>
<idno type="ChapterID">Chap109</idno>
</biblStruct>
</sourceDesc>
<seriesStmt><idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc><textClass></textClass>
<langUsage><language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en">Abstract: Document images obtained from scanners or photocopiers usually have a black margin which interferes with subsequent stages of page segmentation algorithms. Thus, the margins must be removed at the initial stage of a document processing application. This paper presents an algorithm which we have developed for document margin removal based upon the detection of document corners from projection profiles. The algorithm does not make any restrictive assumptions regarding the input document image to be processed. It neither needs all four margins to be present nor needs the corners to be right angles. In the case of the tilted documents, it is able to detect and correct the skew. In our experiments, the algorithm was successfully applied to all document images in our databases of French and Arabic document images which contain more than two hundred images with different types of layouts, noise, and intensity levels.</div>
</front>
</TEI>
<affiliations><list><country><li>Canada</li>
</country>
</list>
<tree><country name="Canada"><noRegion><name sortKey="Haji, Mehdi" sort="Haji, Mehdi" uniqKey="Haji M" first="Mehdi" last="Haji">Mehdi Haji</name>
</noRegion>
<name sortKey="Bui, D" sort="Bui, D" uniqKey="Bui D" first="D." last="Bui">D. Bui</name>
<name sortKey="Bui, D" sort="Bui, D" uniqKey="Bui D" first="D." last="Bui">D. Bui</name>
<name sortKey="Haji, Mehdi" sort="Haji, Mehdi" uniqKey="Haji M" first="Mehdi" last="Haji">Mehdi Haji</name>
<name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
<name sortKey="Suen, Y" sort="Suen, Y" uniqKey="Suen Y" first="Y." last="Suen">Y. Suen</name>
</country>
</tree>
</affiliations>
</record>
Pour manipuler ce document sous Unix (Dilib)
EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000908 | SxmlIndent | more
Ou
HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000908 | SxmlIndent | more
Pour mettre un lien sur cette page dans le réseau Wicri
{{Explor lien |wiki= Ticri/CIDE |area= OcrV1 |flux= Main |étape= Exploration |type= RBID |clé= ISTEX:DFD938AAD9B41127DA9E2CDF0DDEA95C9B8A83A3 |texte= Simultaneous Document Margin Removal and Skew Correction Based on Corner Detection in Projection Profiles }}
This area was generated with Dilib version V0.6.32. |